Live freelance tracking. Raw descriptions turned into structured data. Find your next tech project without the noise.
upwork.com π’ 2026-05-10
πΉ Automated Real Estate Data Pipeline Developer
π€ Client: πΊπΈ USA Member since 2025-03-12
π° Price: ****
π© Problem: Develop and maintain an automated system to collect, organize, and deduplicate public-record property data.
π¦ Existing: Not specified
Specifications:
[Target] Indianapolis, IN, Kansas City, MO, Cincinnati, OH
[Method] Python-based automation with Playwright/Selenium/Scrapy for web scraping
[UI/UX] Not applicable (backend-focused)
[Stack] Python, Playwright, Selenium, Scrapy, BeautifulSoup, Cloud-hosted data pipelines
[Security] Compliance with public data access policies and legal requirements
[Format] Clean datasets in Excel or Google Sheets
Workflow:
1. Define target ZIP codes and sources for property data collection.
2. Develop web scraping scripts using Python libraries (Playwright, Selenium, Scrapy) to extract relevant data from county/government websites.
3. Implement deduplication logic to ensure unique records in the dataset.
4. Create structured datasets in Excel or Google Sheets format for seller leads and buyer activity tracking.
5. Set up cloud-hosted automation workflows for recurring data collection and organization tasks.
6. Integrate basic logging/error reporting mechanisms for system monitoring.